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Figure 1: Top 10 BLASTP hits for INSP179 polypeptide sequence (SEQ ID NO:10) 
against NCBI-nr 

BLASTP 2.2.2 [Jan-08-20 02 ] 



Reference: Altschul, Stephen F . , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:338 9-3402. 

Query= INSPl7 9.pp 

(880 letters) 

Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,594,288 sequences; 522,190,286 total letters 



Searching done 

Sequences producing significant alignments: 

ref |XP_291099.2 | similar to hypothetical protein MGC38937 [Homo 
ref |NP_705796. 1 | hypothetical protein MGC38937 [Mus musculus] >g., 
ref 1XP_223356.1| similar to hypothetical protein MGC38937 [Rattu.. 
ref |XP_221358.2 | similar to Carboxypeptidase N 83 kDa chain (Car., 
ref |NP_570843 . 1 1 leucine rich repeat containing 15; leucine-rich . 
gb |AAH25836.1| 1300018KllRik protein [Mus musculus] 

ref |XP_148373 . 1 1 RIKEN cDNA 1300018K11 [Mus musculus] >gil380806., 
ref | NP_659551 . 1 | leucine-rich repeat protein induced by beta-amy. 
dbj 1 BAD01045 . 1 | toll-like receptor 3 [Paralichthys olivaceus] 
dbj 1 BAB854 98 . 1 | 18 wheeler [Bombyx mori] 
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Figure 2: Pairwise alignment of INSP179 to top annotated BLASTP hit (mouse 
ortholog) 

>ref |NP_7 057 96. 1 | hypothetical protein MGC38937 [Mus musculus] 
gb | AAH31901.il Hypothetical protein MGC38937 [Mus musculus] 
dbj ] BAC347 97 . 1 | unnamed protein product [Mus musculus] 
Length = 872 

Score = 684 bits (1765), Expect =0.0 

Identities = 422/888 (47%), Positives = 529/888 (59%), Gaps = 41/888 (4%) 

Query: 1 MKNLYFRVITIVIGLYFTGIMTNASRKSNILFNSECQWNEYILTNCSFTGKCDIPVDISQ 60 

M4-4- Y RV +V GL F +T SRKS4-4- FN E Q N +L N S 4-SQ 
Sb j ct : 1 MRDFYVRVTILVTGLCFVETVTTPSRKSSVSFNPEYQRNGDLLVNWSSIRH- VSQ 54 

Query: 61 TAATVDVSFNFFRVLLQSHTKKEEWKIKHLDLSNNLISKITLSPFAYLHALEVLNLSNNA 120 

+D SF FFRVL Q HT+KE IK D 4-4-+ ISK+TL P A+LHALE+LNLSN A 
Sbj ct : 55 NTDAMDRSFYFFRVLFQPHTQKER-HIKPPDRTHHRISKVTLDPLAHLHALEILNLSNKA 113 

Query: 121 IHSLSLDLLSPKSSWVKRHRSSFRNRFPLLKVLILQRNKLSDTPKGLWKLKSLQSLDLSF 18 0 

IH SLD P SS KRH +R P L+VLILQRN+LS TPKGLWKLKSL+SLDLSF 

Sbjct: 114 IHYFSLDQPLPPSSHQKRHGGHSHSRLPRLQVLILQRNQLSGTPKGLWKLKSLRSLDLSF 173 

Query: 181 NGILQIGWSDFHNCLQLENLCLKSNKIFKIPPQAFKDLKKLQVIDLSNNALITILPMMII 240 

N 1+ IG SDFH CLQLE++ LKSNKI I P+AFK LKKLQV+DL +NAL T++P++ I 
Sbjct: 174 NRIVHIGLSDFHGCLQLESIYLKSNKICTIHPKAFKGLKKLQWDLRSNALTTLVPIVTI 233 

Query: 241 ALEFPHLWDLADNNWQCDDSVAVFQNFISESWRKKWNVICNRSIGSEEANGGTPQSRIS 300 

ALE PHL + LADN WQC +S FQN S SWR+ W ICN S+ ++ N TQRS 
Sbjct: 234 ALELPHLELGLADNQWQCSESNVNFQNITSSSWREIWKAICNMSVENKRPNAETHQIRKS 293 

Query: 301 RETRL PPIHLHRMKSLIRSKAERPQGGRHTGISTLGKKAKAGSG-LRKKQRRLPRSV 356 

R+T L PP L KSLI+SKAERPQ G +S LGK+AK G G LR + P + 

Sbjct: 2 94 RDTHLLLSPPSDL KSLIQSKAERPQAGMDMHLSALGKEAKDGYGDLRGMWPQSPVEL 350 

Query: 357 RS TRDVQAAGKKE DAPQDL ALAVCLS VFI T FLVAFS LGAFTRP Y VDRLWQKKCQ SKS PGL 416 

R ++D Q +K+D P L LA+CLSVFITF+VAF LGAF RPY+DRL Q++C +K PG 
Sbjct: 351 RDSQDEQVTDRKDDKPPALELAICLSVFITFWAFCLGAFARPYIDRLRQQRCSNKRPGS 410 

Query: 417 DNAYSNEGFYDDMEAAGHTPHPETHLRQVFPHLSLYENQTPFWVTQPHPHATVIPDRTLG 476 

DNAYSN+GF+ D+E AH + TLQ HLL ENQ P WV +P PH+ V +4- LG 
Sbjct: 411 DNAYSNKGFHGDIEGAQHMEYQGTDLHQTTHHLHLSENQNPSWVAEPIPHSAVQSEQMLG 470 

Query: 477 RSRKDPGSSQSPGQCGDNTGAGSGND- GAV Y S I LQRH PHAGNRE LMS AAQ DH I HR 530 

4- DPG QSP Q D+ + SG+ + L P+A 4- +S Q H 
Sbjct: 471 SN6TDPGHQQSPEQLKDSNKSRSGDSIVLPSGPVAHLALHGLPNADAHKAISPVQ HH 527 

Query: 531 NDILGEWTYETVAQEEPLSAHSVGVSSVAGTSHAVSGSSRYDSNELDPSLSGEITASLCK 590 

+D L E Y+TVAQE L + SS+ G S ++L PS 4-+ AS K 

Sbjct: 528 HDFLEEAHYDTVAQEYSLIDDVMDRSSITGPLGTFPSSVESRRDDLHPSQPRDWASFSK 587 

Query: 591 MLTHAEAQRTGDSKERGGTEQ-SLWDSQMEFSKERQVS SSI DLLS IQQPRLSGARAEEAL 649 

L HA + S E G E DSQM S4-ERQVS4-SI L+ QQP G AEE L 

Sbjct: 588 TLAHANTREAEGSMETGCPEPLGAMDSQMGSSEERQVSNSIRELATQQPSFQGVDAEERL 647 

Query: 650 SAHYSEVPYGDPRDTGPSVFPPRWDSGLDVTPANKEPVQKSTPSDTCCELES — DCDSDE 7 07 

S YSEV + DP PS4- PRW SG V PA EPV+4- P D +L + + DSDE 

Sbjct: 648 SHVYSEVLHNDP PS L-RPRWGSGHYV I PATGE PVERDAP FD PHYDLVTN YE S D S DE 702 

Query: 708 GSLFTLSSISSESARSKTEEAVPDE -ESLQDESS GAS KDN VT AV D S LEEN VT FQT I 7 62 

GSLFTLSS SE RS EE E 4- L 4- G KD+VT4- +S+E4- +T Q I 

Sbjct: 703 GSLFTLSSEGSEDTRSLAEEQASVENDGTSQPLPSRNLGEYKDSVTSAESVED-LTSQRI 7 61 

Query: 7 63 PGKCKNQEDPFEKPLISAPDSGMYKTHLENASDTDRSEGLSPWPRSPGNSPLGDEFPGMF 822 

P KC4- QE LIS PDS 4- 4-T4- EN S 4* E S WP+ PG+ E G 4- 

Sbjct: 7 62 PEKCEAQEAHLRNTLISGPDSCVCETNQENDSSSLDPENRSTWPQLPGHKLSHHETLGTY 821 
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Query: 823 TYDYDTALQSKAAEWHCSLRDLEFSNVDVLQQTPPCSAEVPSDPDKAA 870 

D QS+A +WH SLRDLE NVD +PP S E S P+ A 
Sbjct: 822 G D I E PQ S E AVDWHY S LRD LE S PN VD - S S P S P P Y S DE DL S G PE DRA 8 65 



WO 2005/080425 



4/15 



PCT/GB2005/000514 



Figure 3: Domain Professor results for INSP179 
Query: INSP179.pp 
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Figure 4: Signal peptide prediction (SignalP V2.0) for INSP179 polypeptide sequence 
(SEQ ID NO: 10). 
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Figure 5: INSP179 DNA and polypeptide sequ 

1 ttccgaaatc ttaaattgaa aattaaattt 
61 attgtgaata agtcaattca ttttcctttt 



121 agtcattacc atagttatag gtctttattt 
r v i v i g 1 y 

181 aagcaatatt ttattcaatt ctgaatgcca 
ksni 1 f n sec 

241 ttttaccgga aagtgtgata tacctgtgga 
sftg kcd ipv 

301 aagtttcaat ttctttagag ttctcttaca 
vsfn ffr vll 

361 aaaacatctg gacctcagta acaatctcat 
ikhl dls n n 1 

421 tttacatgct ttggaagtgt taaacctcag 
ylha lev lnl 

481 tctactcagt cctaagtcct catgggtgaa 
d 1 1 s' p k s swv 

541 tccattgctg aaggtgctca ttcttcaaag 
fpll kvl ilq 

601 gtggaaactg aagtcattgc agagtttgga 
lwkl ksl qsl 

661 gtggtctgat tttcacaact gcctgcaact 
gwsd fhn c lq 

721 attcaaaatt cccccacaag ccttcaagga 
ifki ppq afk 

7 81 caacaatgct ctgattacca tcctaccaat 
snna lit ilp 

841 agtggttgac ttggctgata ataactggca 
lvvd lad nnw 

901 ttttatttct gaatcctgga ggaaaaagtg 
nfis esw rkk 

961 tgaggaggcc aacgggggca ctccccagag 
seea ngg tpq 

1021 cattcatctg catcgcatga aaagcctcat 
pihl hrm ksl 

1081 aaggcacacg ggcatttcta ctctggggaa 
grht gis tig 



gctgcttatt gctgttagtt tttatatatt 
tccaggcatt atgaaaaacc tctatttcag 
m k n 1 y f 

INSP179-CP1 

tactggaata atgacaaatg catcaagaaa 
ftgi mtn asr 

atggaatgaa tatattctga caaattgttc 
qwne yil tnc 

catatcacag acagcagcca ctgtggatgt 
disq taa tvd 

gtctcacacg aaaaaagaag agtggaaaat 
qsht kke ewk 

atcaaaaata accttaagcc cttttgcata 
iski tls pfa 

caacaatgcc atccactccc tctcattgga 
snna ihs lsl 

acgccacaga agcagcttca gaaacaggtt 
krhr ssf rnr 

aaataaactc agtgacactc ccaagggact 
rnkl sdt pkg 

tctgtcattc aatgggatat tgcaaatagg 
dlsf ngi Iqi 

ggagaatctc tgtttaaaga gcaacaagat 
lenl elk snk 

cctcaaaaaa ttacaggtca tagaccttag 
dlkk lqv idl 

gatgatcata gctctagaat ttccccatct 
mmii ale fph 

gtgtgatgat agtgtggcag tctttcaaaa 
qedd sva vfq 

gaatgtcatt tgcaacaggt ctatagggag 
wnvi enr sig 

caggatttcc agggaaaccc gccttcctcc 
sris ret rip 

aaggagcaaa gcagagaggc cccagggagg 
irsk aer pqg 

gaaggcaaag gccggctctg gtctcaggaa 
kkak ags glr 
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1141 gaagcagaga cggctgccaa ggagtgttag aagcacccgc gatgtgcagg ctgccggcaa 
kkqr rip rsv r s t r d v q a _a gj 

^ INSP179-CP2 

1201 aaaagaggac gctccccagg acctggctct ggcggtgtgc ctgtcagtgt tcatcacatt 

[k^'"K_a.;"d a\ ^ I""4LA^X.J" Xl". a ~ 7. c ~ i . «L fit 

12 61 ccttgtcgcc ttcagcctgg gggctttcac aaggccttat gttgacagac tgtggcaaaa 
flva fsl gaf trpy vdr lwq 

1321 aaagtgccag agcaaaagcc ctggcctgga caacgcgtat tcaaacgagg gcttctacga 
kkcq sks pgl dnay sne gfy 

1381 tgacatggaa gctgcggggc acacaccaca cccagagacc catctgcgcc aagtatttcc 
ddme aag htp hpet hlr qvf 

1441 tcatctaagc ctctacgaga accagacccc tttctgggtg acacagccac acccacacgc 
phis lye nqt pfwv tqp hph 

1501 caccgtaatt cctgatagaa ctctgggaag gagcagaaag gatcctggca gttcgcagag 
atvi pdr tig rsrk dpg ssq 

1561 cccaggacag tgcggggaca acaccggggc aggaagtgga aatgatggtg cagtctattc 
spgq cgd ntg agsg ndg avy 

1621 cattctccag agacatccac atgccggtaa ccgtgaacta atgtcagcag cgcaggacca 
silq rhp hag nrel msa aqd 

1681 catccatagg aatgatattc tcggagaatg gacttatgaa actgtggccc aggaagagcc 
hihr ndi lge wtye tva qee 

1741 tctcagtgca cattcagtgg gcgtctcttc tgtagctggc acgtctcacg ctgtctctgg 
plsa hsv gvs svag tsh avs 

1801 ctcaagccgt tatgattcca atgaattaga cccttccctc tccggagaaa taacagcttc 
gssr yds n e 1 dpsl sge ita 

18 61 cctctgtaaa atgctaacac atgcagaagc acagaggact ggagatagta aggaaagagg 
sick mlt hae aqrt gds ker 

1921 gggcactgaa cagtcacttt gggactcgca gatggaattt tctaaggaaa ggcaagtgag 
ggte qsl wds qmef ske rqv 

1981 ttcatccatt gatttgctga gcatacagca gccaaggctg tccggggcaa gggctgagga 
sssi dll siq qprl sga rae 

2041 agcgctttca gcccactaca gcgaggttcc atacggtgac ccaagagaca caggcccatc 
eals ahy sev pygd prd tgp 

2101 agtctttcct ccaagatggg acagtggcct ggatgtcact cctgctaaca aggaaccagt 
svfp prw dsg Idvt pan kep 

2161 gcagaaatcc actccttctg acacttgctg tgagttggag agtgactgtg actctgatga 
vqks tps dtc cele sdc dsd 

2221 ggggtctctg ttcactctga gctccataag ttcagagagt gcaaggagca agactgaaga 
egsl ftl ssi sses ars kte 

2281 ggcagtgcct gatgaggagt ccctgcagga cgagagctca ggggcaagca aggacaatgt 
eavp dee slq dess gas kdn 
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2341 gacggctgta gacagtcttg aggaaaatgt 
vtav dsl een 

2401 gaatcaagaa gatccctttg aaaaacctct 
knqe dpf ekp 

24 61 gactcatctg gaaaatgcct ctgacactga 
kthl ena sdt 

2521 gtcaccaggg aatagtccct taggggatga 
rspg nsp lgd 

2581 cacagctctt caatccaagg cagcagaatg 
dtal qsk aae 

2641 aaatgtggac gttttacagc aaacaccacc 
snvd vlq q t p 

2701 taaggctgcc ttccatgaaa gagactcaga 
dkaa fhe rds 

27 61 aatattcaca gctctaaaca atattctttt 
2821 tcacaacaag atcctgaagg agacaacatg 
2881 gagggttttg tgggcccact tgagggcgat 



taccttccaa acaattccag ggaaatgcaa 
vtfq tip gkc 

catttctgct ccagactctg gcatgtacaa 
lisa pds gmy 

tagatctgag ggcctgtcac cctggcccag 
drse gls pwp 

gtttccgggc atgttcactt atgattatga 
efpg mft ydy 

gcattgctca cttagagact tagaattttc 
whcs lrd lef 

atgttctgct gaagttccct cagatcctga 
pcsa evp sdp 

cattttaaaa taagaatctt tcattaagga 
d i 1 k 

aaagatcatg gcaggggaaa actaaagcct 
aattccaacc aagtggacac tgatgcaaat 
gatt 



Position and sense of PCR primers 



-► 
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Figure 6: INSP179-EC DNA and polypeptide sequence 



1 atgaaaaacc tctatttcag agtcattacc atagttatag gtctttattt tactggaata 

mkn lyf rvit ivi gly ftgi 

► 

INSP179-CP1 

61 atgacaaatg catcaagaaa aagcaatatt ttattcaatt ctgaatgcca atggaatgaa 

mtn asr ksni Ifn sec qwne 

121 tatattctga caaattgttc ttttaccgga aagtgtgata tacctgtgga catatcacag 
yil tnc sftg kcd ipv disq 

181 acagcagcca ctgtggatgt aagtttcaat ttctttagag ttctcttaca gtctcacacg 
taa t v d vsfn ffr vll qsht 

241 aaaaaagaag agtggaaaat aaaacatctg gacctcagta acaatctcat atcaaaaata 
kke ewk ikhl dls nnl iski 

301 accttaagcc cttttgcata tttacatgct ttggaagtgt taaacctcag caacaatgcc 
t 1 s pfa ylha lev Inl snna 

361 atccactccc tctcattgga tctactcagt cctaagtcct catgggtgaa acgccacaga 
ihs Isl dlls pks swv krhr 

421 agcagcttca gaaacaggtt tccattgctg aaggtgctca ttcttcaaag aaataaactc 
ssf rnr fpll kvl ilq rnkl 

481 agtgacactc ccaagggact gtggaaactg aagtcattgc agagtttgga tctgtcattc 
sdt pkg lwkl ksl qsl dlsf 

541 aatgggatat tgcaaatagg gtggtctgat tttcacaact gcctgcaact ggagaatctc 
ngi lqi gwsd fhn clq lenl 

601 tgtttaaaga gcaacaagat attcaaaatt cccccacaag ccttcaagga cctcaaaaaa 
elk snk ifki ppq afk dlkk 

661 ttacaggtca tagaccttag caacaatget ctgattacca tcctaccaat gatgatcata 
Iqv idl snna lit ilp mmii 

721 gctctagaat ttccccatct agtggttgac ttggctgata ataactggca gtgtgatgat 
ale fph Ivvd lad nnw qedd 

781 agtgtggcag tctttcaaaa ttttatttct gaatcctgga ggaaaaagtg gaatgtcatt 
sva vfq n f i s esw rkk wnvi 

841 tgcaacaggt ctatagggag tgaggaggee aaegggggea ctccccagag caggatttcc 
enr sig seea ngg t p q sris 

901 agggaaaccc gccttcctcc cattcatctg categcatga aaagectcat aaggagcaaa 
ret rip pihl hrm ksl irsk 

961 gcagagaggc cccagggagg aaggcacacg ggcatttcta ctctggggaa gaaggcaaag 
aer pqg grht gis tig kkak 
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1021 gccggctctg gtctcaggaa gaagcagaga cggctgccaa ggagtgttag aagcacccgc 
a g s g 1 r kkqr rip rsv rstr 

M 

1081 gat gt gcag INSP1 79 " CP2 

d v q 



Position and sense of PCR primers 
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Figure 7: INSP179-EC-SV1 DNA and polypeptide sequence 

1 atgaaaaacc tctatttcag agtcattacc atagttatag gtctttattt tactggaata 
mkn lyf rvi t ivi gly ftgi 
INSP179-CP1 

61 atgacaaatg catcaagaaa aagcaatatt ttattcaatt ctgaatgcca atggaatgaa 
mtn asr ksni lfn sec qwne 

121 tatattctga caaattgttc ttttaccgga aagtgtgata tacctgtgga catatcacag 
yil tnc sftg kcd ipv disq 

181 acagcagcca ctgtggatgt aagtttcaat ttctttagag ttctcttaca gtctcacacg 
t a a tvd vsfn ffr vll qsht 

241 aaaaaagaag agtggaaaat aaaacatctg gacctcagta acaatctcat atcaaaaata 
kke ewk ikhl dls nnl iski 

301 accttaagcc cttttgcata tttacatgct ttggaagtgt taaacctcag caacaatgcc 
tls pfa ylha lev lnl snna 

361 atccactccc tctcattgga tctactcagt cctaagtcct catgggtgaa acgccacaga 
i h s lsl dlls pks swv krhr 

421 agcagcttca gaaacaggtt tccattgctg aaggtgctca ttcttcaaag aaataaactc 
s s £ rnr fpll kvl ilq rnkl 

481 agtgacactc ccaaggggag tgaggaggcc aacgggggca ctccccagag caggatttcc 
sdt pkg seea ngg tpq sris 

541 agggaaaccc gccttcctcc cattcatctg catcgcatga aaagcctcat aaggagcaaa 
ret rip pihl hrm ksl irsk 

601 gcagagaggc cccagggagg aaggcacacg ggcatttcta ctctggggaa gaaggcaaag 
aer pqg grht gis tig kkak 

661 gccggctctg gtctcaggaa gaagcagaga cggctgccaa ggagtgttag aagcacccgc 

ags glr kkqr rip rsv rstr 

^ 

721 gatgtgcag INSP179-CP2 

d v q 
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Figure 8: Alignment of INSP179-EC and INSP179-EC-SV1 nucleotide sequences 

17 9EC-SV1 ATGAAAAACCTCTATTTCAGAGTCATTACCATAGTTATAGGTCTTTATTTTACTGGAATA 
INSP179EC ATGAAAAACCTCTATTTCAGAGTCATTACCATAGTTATAGGTCTTTATTTTACTGGAATA 



17 9EC-SV1 
INSP179EC 

17 9EC-SV1 
INSP179EC 



ATGACAAATGCATCAAGAAAAAGCAAFATTTTATTCAATTCTGAATGCCAATGGAATGAA 
AT GACAAATGCAT C AAGAAAAAGCAAT ATT TT ATT CAAT T CTGAAT GCCAAT GGAAT GAA 

TATATTCTGACAAATTGTTCTTTTACCGGAAAGTGTGATATACCTGTGGACATATCACAG 
TATATTCTGACAAATTGTTCTTTTACCGGAAAGTGTGATATACCTGTGGACATATCACAG 



17 9EC-SV1 
XNSP179EC 



ACAGCAGCCACTGTGGATGTAAGTTTCAATTTCTTTAGAGTTCTCTTACAGTCTCACACG 
ACAGCAGCCACTGTGGATGTAAGTTTCAATTTCTTTAGAGTTCTCTTACAGTCTCACACG 



17 9EC-SV1 
INSP179EC 



AAAAAAGAAGAGTGGAAAATAAAACATCTGGACCTCAGT AACAATCT CAT AT CAAAAATA 
AAAAAAG AAG AGT GG AAAAT AAAAC AT C T G GAC CT C AG T AAC AAT C T CAT AT C AAAAAT A 



17 9EC-SV1 
INSP17 9EC 



ACCTTAAGCCCTTTTGCATATTTACATGCTTTGGAAGTGTTAAACCTCAGCAACAATGCC 
ACCTTAAGCCCTTTTGCATATTTACATGCTTTGGAAGTGTTAAACCTCAGCAACAATGCC 



179EC-SV1 
INSP17 9EC 



ATCCACTCCCTCTCATTGGATCTACTCAGTCCTAAGTCCTCATGGGTGAAACGCCACAGA 
ATCCACTCCCTCTCATTGGATCTACTCAGTCCTAAGTCCTCATGGGTGAAACGCCACAGA 



17 9EC-SV1 
INSP179EC 



AGCAGCTTCAGAAACAGGTTTCCATTGCTGAAGGTGCTCATTCTTCAAAGAAATAAACTC 
AG C AG C T T C AGAAAC AG GT T T C C AT T G C T G AAG GT G C T CAT T CT T C AAAG AAAT AAACT C 



17 9EC-SV1 
INSP17 9EC 

179EC-SV1 
INSP179EC 

179EC-SV1 
INSP179EC 

179EC-SV1 
INSP17 9EC 

17 9EC-SV1 
INSP179EC 

179EC-SV1 
INSP17 9EC 



17 9EC-SV1 
INSP17 9EC 



AGT GAC AC T C C C AAG G G — 

AGTGACACTCCCAAGGGACTGTGGAAACTGAAGTCATTGCAGAGTTTGGATCTGTCATTC 

AATGGGATATTGCAAATAGGGTGGTCTGATTTTCACAACTGCCTGCAACTGGAGAATCTC 

TGTTTAAAGAGCAACAAGATATTCAAAATTCCCCCACAAGCCTTCAAGGACCTCAAAAAA 

TTACAGGTCATAGACCTTAGCAACAATGCTCTGATTACCATCCTACCAATGATGATCATA 

GCTCTAGAATTTCCCCATCTAGTGGTTGACTTGGCTGATAATAACTGGCAGTGTGATGAT 

AGTGTGGCAGTCTTTCAAAATTTTATTTCTGAATCCTGGAGGAAAAAGTGGAATGTCATT 

G AGT GAGG AG G C C AACGGGG G C AC T C C C C AG AG C AG GAT TT CC 

T G C AAC AG GT C TAT AG G G AGT GAGG AG G C C AAC GGG G G C AC T C C C CAG AG C AG GAT T T C C 



17 9EC-SV1 
INSP17 9EC 



AGGGAAACCCGCCTTCCTCCCATTCATCTGCATCGCATGAAAAGCCTCATAAGGAGCAAA 
AGGGAAACCCGCCTTCCTCCCATTCATCTGCATCGCATGAAAAGCCTCATAAGGAGCAAA 



179EC-SV1 
INSP17 9EC 



GCAGAGAGGCCCCAGGGAGGAAGGCACACGGGCATTTCTACTCTGGGGAAGAAGGCAAAG 
GCAGAGAGGCCCCAGGGAGGAAGGCACACGGGCATTTCTACTCTGGGGAAGAAGGCAAAG 



17 9EC-SV1 
INSP179EC 



GCCGGCTCTGGTCTCAGGAAGAAGCAGAGACGGCTGCCAAGGAGTGTTAGAAGCACCCGC 
GCCGGCTCTGGTCTCAGGAAGAAGCAGAGACGGCTGCCAAGGAGTGTTAGAAGCACCCGC 



17 9EC-SV1 
INSP179EC 



GATGTGCAG 
GATGTGCAG 
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Figure 9: Alignment of predicted INSP179 and the cloned INSP179-EC and INSP179- 
EC-SV1 polypeptide sequences 



INSP17 9-EC_PREDICTION 
INSP17 9-EC_CLONED 
INSP17 9-EC-SV1 CLONED 



MKNLYFRVITIVIGLYFTGIMTNASRKSNILFNSECQWNEYILTNCSFTG 

MKNLYFRVITIVIGLYFTGIMTNASRKSNILFNSECQWNEYILTNCSFTG 

MKNLYFRVITIVIGLYFTGIMTNASRKSNILFNSECQWNEYILTNCSFTG 
************************************************** 



INSP17 9-EC_PREDICTI0N 
INSP17 9-EC_CLONED 
INSP17 9-EC-SV1 CLONED 



KCDIPVDISQTAATVDVSFNFFRVLLQSHTKKEEWKIKHLDLSNNLISKI 

KCDIPVDISQTAATVDVSFNFFRVLLQSHTKKEEWKIKHLDLSNNLISKI 

KCDI PVDI SQTAAT VDVS FNFFRVLLQSHTKKEEWKIKHLDLSNNLI SKI 
************************************************** 



INSP17 9-ECJPREDICTION 
INSP17 9-EC_CLONED 
INSP17 9-EC-SV1 CLONED 



TLSPFAYLHALEVLNLSNNAIHSLSLDLLSPKSSWVKRHRSSFRNRFPLL 

TLSPFAYLHALEVLNLSNNAIHSLSLDLLSPKSSWVKRHRSSFRNRFPLL 

TLS PFAYLHALEVLNLSNNAIHSLSLDLLS PKS SWVKRHRS S FRNRFPLL 
************************************************** 



INSP17 9-EC_PREDICTION 
INSPl7 9~EC_CLONED 
INSP17 9-EC-SV1 CLONED 



KVLILQRNKLSDTPKGLWKLKSLQSLDLSFNGILQIGWSDFHNCLQLENL 
KVLILQRNKLSDTPKGLWKLKSLQSLDLSFNGILQIGWSDFHNCLQLENL 

KVLILQRNKLSDTPK 

*************** 



INSP179-EC_PREDICTION 
INSP179-EC_CL0NED 
INSP17 9-EC-SV1 CLONED 



CLKSNKI FKI P PQAFKDLKKLQVI DLSNNAL I T I LPMMI I ALE FPHLWD 
CLKSNKI FKI P PQAFKDLKKLQVI DLSNNAL I T I L PMMI I ALE FPHLWD 



INSP17 9-EC_PREDICTION 
IN S PI 7 9 -EC_CLONE D 
INSP17 9-EC-SV1 CLONED 



LADNNWQCDDSVAVFQNFISESWRKKWNVICNRSIGSEEANGGTPQSRIS 
LADNNWQCDDSVAVFQNFISESWRKKWNVICNRSIGSEEANGGTPQSRIS 

— GSEEANGGTPQSRIS 

*************** 



INSPl7 9-EC_PREDICTION 
INSPl7 9-EC_CLONED 
INSP17 9-EC-SV1 CLONED 



RETRLPPIHLHRMKSLIRSKAERPQGGRHTGISTLGKKAKAGSGLRKKQR 

RETRLPPIHLHRMKSLIRSKAERPQGGRHTGISTLGKKAKAGSGLRKKQR 

RETRLPPIHLHRMKSLIRSKAERPQGGRHTGISTLGKKAKAGSGLRKKQR 
************************************************** 



INSP17 9-EC_PREDICTI0N 
INSP17 9-EC_CLONED 
INSP17 9-EC-SV1 CLONED 



RLPRSVRSTRDVQ 
RLPRSVRSTRDVQ 
RLPRSVRSTRDVQ 
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Figure 10: Predicted INSP179 glycosylation sites 



NetNGlyc 1.0: predicted N-glycosy la-fcion sites in Sequence 
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